System and method for automatically transcribing voicemail

ABSTRACT

Disclosed herein are systems, computer-implemented methods, and tangible computer-readable media for automatically transcribing voicemail. The method includes receiving a plurality of voicemail messages from callers, identifying for each voicemail message in the plurality of voicemail messages a first frequency with which the respective caller leaves voicemails, identifying for each voicemail message in the plurality of voicemail messages a second frequency with which a user requests transcription of each voicemail, assigning a priority ranking to each voicemail message in the plurality of voicemail messages based on the respective first frequency and the respective second frequency, and transcribing untranscribed voicemail messages with a highest priority ranking. The method can include establishing a priority ranking threshold and repeatedly transcribing a next highest ranking untranscribed voicemail message until no further untranscribed voicemail messages remain above the priority ranking threshold.

PRIORITY INFORMATION

The present application is a continuation of U.S. patent applicationSer. No. 12/328,370, filed Dec. 4, 2008, the content of which isincorporated herein by reference in its entirety.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to voicemail and more specifically toautomatically transcribing voicemails according to priority.

2. Introduction

Telecommunications companies often provide voicemail services forcustomers. A typical voicemail service allows callers to record voicemessages for later playback by the telephone customer. One specializedfeature of voicemail service is transcribing voicemails so thattelephone customers can understand messages more clearly, read messagesfaster, and even copy and paste information from the message. However,transcribing each stored voicemail is an enormous computational taskeven for high powered mainframes or clusters of computers because of thecrushingly high volume of voicemails recorded every day. For example, ifa telecommunications company has 200,000 customers with voicemail boxes,and only 25% of those receive an average of 2 voicemails per day thatare an average of 45 seconds long, the voicemail system receives roughly1250 hours' (just over 52 days') worth of voicemail per day. Thus, forthis very modest scenario, a transcription computing device would needto transcribe emails roughly 52 times faster than real time in order tokeep up with received voicemails. Installing, configuring, maintaining,and powering computer systems sufficiently powerful to transcribe andstore all received voicemails in advance is expensive and timeconsuming.

In addition, transcription is not needed in every case. Only a subset oftelephone customers request transcriptions. The remaining unrequestedtranscriptions are essentially wasted and provide little or no benefitfor telephone customers at a great burden to the telecommunicationscompany.

One approach in the art to solve these problems is to transcribevoicemails on demand. On-demand transcriptions solve the problem ofwasted computational effort performing unnecessary transcriptions, butthey have an additional problem of being slow. On-demand transcriptionscan perform somewhat faster with less computation, but they trade speedfor reduced accuracy. Accordingly, what is needed in the art is animproved way to transcribe voicemails.

SUMMARY

Additional features and advantages of the invention will be set forth inthe description which follows, and in part will be obvious from thedescription, or may be learned by practice of the invention. Thefeatures and advantages of the invention may be realized and obtained bymeans of the instruments and combinations particularly pointed out inthe appended claims. These and other features of the present inventionwill become more fully apparent from the following description andappended claims, or may be learned by the practice of the invention asset forth herein.

Disclosed are systems, computer-implemented methods, and tangiblecomputer-readable media for automatically transcribing voicemail. Themethod includes receiving a plurality of voicemail messages fromcallers, identifying for each voicemail message in the plurality ofvoicemail messages a first frequency with which the respective callerleaves voicemails, identifying for each voicemail message in theplurality of voicemail messages a second frequency with which a userrequests transcription of each voicemail, assigning a priority rankingto each voicemail message in the plurality of voicemail messages basedon the respective first frequency and the respective second frequency,and transcribing untranscribed voicemail messages with a highestpriority ranking. In one aspect, the method repeatedly transcribes thenext highest ranking untranscribed voicemail messages until no furtheruntranscribed voicemail messages remain. The method can includeestablishing a priority ranking threshold and repeatedly transcribing anext highest ranking untranscribed voicemail message until no furtheruntranscribed voicemail messages remain above the priority rankingthreshold. In one variation, the method further generatesspeaker-specific syllable libraries while transcribing voicemailmessages, receives incoming text-based messages from a speaker, andreads aloud the incoming text-based message using the speaker-specificsyllable library associated with the speaker.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to describe the manner in which the above-recited and otheradvantages and features of the invention can be obtained, a moreparticular description of the invention briefly described above will berendered by reference to specific embodiments thereof which areillustrated in the appended drawings. Understanding that these drawingsdepict only exemplary embodiments of the invention and are not thereforeto be considered to be limiting of its scope, the invention will bedescribed and explained with additional specificity and detail throughthe use of the accompanying drawings in which:

FIG. 1 illustrates an example system embodiment;

FIG. 2 illustrates an example method embodiment; and

FIG. 3 illustrates an example voicemail transcription system.

DETAILED DESCRIPTION

Various embodiments of the invention are discussed in detail below.While specific implementations are discussed, it should be understoodthat this is done for illustration purposes only. A person skilled inthe relevant art will recognize that other components and configurationsmay be used without parting from the spirit and scope of the invention.

With reference to FIG. 1, an exemplary system includes a general-purposecomputing device 100, including a processing unit (CPU) 120 and a systembus 110 that couples various system components including the systemmemory such as read only memory (ROM) 140 and random access memory (RAM)150 to the processing unit 120. Other system memory 130 may be availablefor use as well. It can be appreciated that the invention may operate ona computing device with more than one CPU 120 or on a group or clusterof computing devices networked together to provide greater processingcapability. A processing unit 120 can include a general purpose CPUcontrolled by software as well as a special-purpose processor. An IntelXeon LV L7345 processor is an example of a general purpose CPU which iscontrolled by software. Particular functionality may also be built intothe design of a separate computer chip. An STMicroelectronics STA013processor is an example of a special-purpose processor which decodes MP3audio files. Of course, a processing unit includes any general purposeCPU and a module configured to control the CPU as well as aspecial-purpose processor where software is effectively incorporatedinto the actual processor design. A processing unit may essentially be acompletely self-contained computing system, containing multiple cores orCPUs, a bus, memory controller, cache, etc. A multi-core processing unitmay be symmetric or asymmetric.

The system bus 110 may be any of several types of bus structuresincluding a memory bus or memory controller, a peripheral bus, and alocal bus using any of a variety of bus architectures. A basicinput/output (BIOS) stored in ROM 140 or the like, may provide the basicroutine that helps to transfer information between elements within thecomputing device 100, such as during start-up. The computing device 100further includes storage devices such as a hard disk drive 160, amagnetic disk drive, an optical disk drive, tape drive or the like. Thestorage device 160 is connected to the system bus 110 by a driveinterface. The drives and the associated computer readable media providenonvolatile storage of computer readable instructions, data structures,program modules and other data for the computing device 100. In oneaspect, a hardware module that performs a particular function includesthe software component stored in a tangible computer-readable medium inconnection with the necessary hardware components, such as the CPU, bus,display, and so forth, to carry out the function. The basic componentsare known to those of skill in the art and appropriate variations arecontemplated depending on the type of device, such as whether the deviceis a small, handheld computing device, a desktop computer, or a computerserver.

Although the exemplary environment described herein employs the harddisk, it should be appreciated by those skilled in the art that othertypes of computer readable media which can store data that areaccessible by a computer, such as magnetic cassettes, flash memorycards, digital versatile disks, cartridges, random access memories(RAMs), read only memory (ROM), a cable or wireless signal containing abit stream and the like, may also be used in the exemplary operatingenvironment.

To enable user interaction with the computing device 100, an inputdevice 190 represents any number of input mechanisms, such as amicrophone for speech, a touch-sensitive screen for gesture or graphicalinput, keyboard, mouse, motion input, speech and so forth. The input maybe used by the presenter to indicate the beginning of a speech searchquery. The device output 170 can also be one or more of a number ofoutput mechanisms known to those of skill in the art. In some instances,multimodal systems enable a user to provide multiple types of input tocommunicate with the computing device 100. The communications interface180 generally governs and manages the user input and system output.There is no restriction on the invention operating on any particularhardware arrangement and therefore the basic features here may easily besubstituted for improved hardware or firmware arrangements as they aredeveloped.

For clarity of explanation, the illustrative system embodiment ispresented as comprising individual functional blocks (includingfunctional blocks labeled as a “processor”). The functions these blocksrepresent may be provided through the use of either shared or dedicatedhardware, including, but not limited to, hardware capable of executingsoftware and hardware, such as a processor, that is purpose-built tooperate as an equivalent to software executing on a general purposeprocessor. For example the functions of one or more processors presentedin FIG. 1 may be provided by a single shared processor or multipleprocessors. (Use of the term “processor” should not be construed torefer exclusively to hardware capable of executing software.)Illustrative embodiments may comprise microprocessor and/or digitalsignal processor (DSP) hardware, read-only memory (ROM) for storingsoftware performing the operations discussed below, and random accessmemory (RAM) for storing results. Very large scale integration (VLSI)hardware embodiments, as well as custom VLSI circuitry in combinationwith a general purpose DSP circuit, may also be provided.

The logical operations of the various embodiments are implemented as:(1) a sequence of computer implemented steps, operations, or proceduresrunning on a programmable circuit within a general use computer, (2) asequence of computer implemented steps, operations, or proceduresrunning on a specific-use programmable circuit; and/or (3)interconnected machine modules or program engines within theprogrammable circuits.

Having disclosed some fundamental system components, the disclosureturns to the exemplary method embodiment for transcribing voicemail asillustrated in FIG. 2. For clarity, the method is discussed in terms ofa system configured to practice the method. The system receives aplurality of voicemail messages from callers (202). The system can be avoicemail server for a telecommunications company or a small business.The system can receive voicemail messages left through a telephone aswell as through email or voice over IP (VoIP). In one typical scenario,callers attempt to dial a telephone company customer, but the customeris unavailable or does not answer the telephone call. The server promptsthe caller to leave a message and records an audio message from thecaller for the customer.

The system identifies two frequencies upon which to base a priority foreach voicemail. First, the system identifies for each voicemail messagein the plurality of voicemail messages a first frequency with which therespective caller leaves voicemails (204). This frequency indicates howoften this particular caller leaves voicemails. Second, the systemidentifies for each voicemail message in the plurality of voicemailmessages a second frequency with which a user requests transcription ofeach voicemail (206). This frequency indicates how often the telephonecustomer requests transcription, whether explicitly or otherwise. Thesystem can either calculate these frequencies anew for each voicemail orthe system can maintain a table showing how often each user requestsvoicemail transcriptions and/or how often each caller leaves voicemails.The first time the system transcribes a voicemail from a caller, thetranscription can be slow and in real time if the system has nottranscribed voicemail from that caller before. The system canpreemptively transcribe later messages from the same caller because eachtranscription increases the ranking and/or the frequency for thatcaller.

The system assign a priority ranking to each voicemail message in theplurality of voicemail messages based on the respective first frequencyand the respective second frequency (208). In one embodiment, thepriority ranking is a number, with higher numbers being high priorityand lower numbers being low priority, or vice versa. Depending on thedesired granularity, the system can allocate more or less bits to storethe priority ranking. For example, if the system allocates 4 bits, thesystem can store 2⁴=16 priority ranking levels. The priority rankingreflects the two frequencies above. For example, if one customerrequests voicemail transcriptions for every single voicemail message,the system assigns a very high priority ranking to voicemails in thatcustomer's voicemail box. The system prioritizes voicemails in thesystem using the priority ranking in order to determine which voicemailsmost likely require transcription. In one aspect, the system assignspriority further based on a user profile. For example, a customer canset up preferences for the system to transcribe every voicemail receivedbetween 9 a.m. and 5 p.m. on Tuesdays and Thursdays, no voicemails fromunknown callers, or only those voicemails older than 24 hours.

The system can keep track of untranscribed voicemails with a priorityqueue data structure storing pointers to respective voicemails in adatabase. A traditional queue data structure pushes new objects on theback of the queue and pops objects off the front of the queue. Atraditional queue processes objects “first in, first out”, meaning thatobjects that are in the queue the longest are processed first. Apriority queue processes objects not based on time in the queue, butrather on priority ranking. The system can take in to account othervariables in addition to the two frequencies when assigning priorityrankings to voicemails, such as time in the queue, whether or not thevoicemail contains information which would be useful to copy and paste(such as an home address, email address, or phone number), or listeningto voicemail on a device where copying and pasting is available (such asan Apple iPhone or a personal computer).

The system transcribes untranscribed voicemail messages with a highestpriority ranking (210). This process can be called anticipatorypreemptive transcription. Once the system determines the highest valuevoicemails, the system transcribes the highest priority voicemailsfirst. The system can store transcribed voicemail messages for laterretrieval. In one embodiment, the system repeatedly transcribes the nexthighest ranking untranscribed voicemail messages until no furtheruntranscribed voicemail messages remain. In this way, the systemrepeatedly processes the queue until the queue is empty and allvoicemail messages are transcribed. The system devotes processingresources to high priority voicemail messages as they are received andworks through the lower priority ones when the high priority ones arecompleted. In yet another variation, the system establishes a priorityranking threshold, and repeatedly transcribes a next highest rankinguntranscribed voicemail message until no further untranscribed voicemailmessages remain above the priority ranking threshold. The system candetermine, for example, that the chances of needing a transcription ofvoicemail messages below a certain priority ranking are so low that thesystem can safely ignore them. The principles described herein can beused in combination with a translation engine to translatetranscriptions into different languages.

The system can further build a voice model of a respective caller whiletranscribing voicemail messages and transcribe additional voicemailmessages from the respective caller with the voice model. A tailoredvoice model of a specific caller enables the system to recognize speechfaster and with greater accuracy. Voice models can be tied to a specificcaller with a unique identifier such as a telephone number.

In one variation, the system further generates speaker-specific syllablelibraries while transcribing voicemail messages, receives incomingtext-based messages, such as emails, text messages, and instantmessages, from a speaker, and reads aloud the incoming text-basedmessage using the speaker-specific syllable library associated with thespeaker. In this way, the system can extract a supplementary benefitfrom the computational work done to transcribe voicemails. Emails andtext messages can sound more personal when the system plays them in thevoice of the sender instead of a generic synthetic voice. This variationcan be useful for listening to email over a traditional phone, a cellphone without an Internet connection, an Internet-enabled cell phonewith a screen too small to comfortably read email, or over a Bluetoothheadset when driving a vehicle.

FIG. 3 illustrates an example voicemail transcription system 300. Thesystem 300 receives voicemails from callers 302. Callers can leavevoicemails through cellular phones, voice over IP (VoIP) phones,landline phones, and even computers. The voicemail server 304 receivesthe voicemails. The voicemail server keeps track of how often aparticular caller leaves voicemails and how often called parties requesttranscription. The server 304 uses these two variables to determine apriority ranking 308 for each voicemail. The priority ranking can be asingle number that reflects the two-dimensions of transcriptionfrequency and caller frequency. For instance, the system can assign ahigh priority ranking 308 to a voicemail addressed to a user whotranscribes 89% of his incoming voicemails. The server 304 pushesincoming voicemails on an untranscribed voicemail priority queue 306 inorder of priority.

In some system configurations, voicemail messages with the lowestpriority rankings may never get transcribed because of limited systemresources. Furthermore, because many of those low-priority voicemailmessages come from unknown numbers, the customer might never explicitlyrequest a transcription. Accordingly, such voicemails can sit at thebottom of the queue forever since high-priority voicemail messages keepcoming. To remedy this issue, the system can include a “periodic wordsspotting” module. The periodic words spotting module can operate onvoicemail messages which have been in the queue for some thresholdperiod of time, such as 24 hours, 2 days, or 1 week. The periodic wordsspotting module can operate according to the following steps. First, theperiodic words spotting module stamps every voicemail message in thequeue to indicate the date and time the system inserted the voicemailmessage into the queue. Second, the periodic words spotting modulestamps every voicemail message in the queue with a waiting stamp toindicate a future time at which the system will take an action for thisvoicemail message. Third, the periodic words spotting module scans thewaiting stamp of all voicemail messages in the queue at an off-peakhour, such as 2 a.m. If the date indicated by the waiting stamp haspassed for a voicemail message, the system selects that voicemailmessage. If multiple waiting stamp dates have passed, the system selectsvoicemail messages in the order of their entry stamps. If multiplevoicemails have the same waiting stamp, the system selects those thathave never been transcribed first. If still more items match theselection sequence, the system selects them in a random order.

Fourth, the periodic words spotting module selects the key words from aprofile of personal interests constructed for the person or a group ofpersons who are the recipients of the incoming voicemail messages. Thisprofile of personal interests can contain a list of words and/or phrasesunique to their life, conversations, or business such as school names,city names, client names, certain dates, product names, etc. Fifth, theperiodic words spotting module performs a fast scan of the entirevoicemail message for each selected message in order to spot wordsand/or phrases in the profile of personal interests to produce a chainof words spotted in the voicemail message in a sequence they are found.Some example chains of words are “Central High, Springfield, 1984”,“Barcelona, Spain”, and “soccer, fishing, Yellowstone”. In a sense, thesystem transcribes this type of low-priority voicemail message as aselected list of words rather than a word-by-word level as the systemcurrently does for other high-priority voicemail messages. In this way,the system performs a shortened transcription of the lower priorityvoicemail which uses a fraction of the system resources required for anormal full-text transcription.

The profile of personal interests for the customer can be created frommultiple sources, such as interests explicitly defined by the customer,such as through an automated interview process, and interests extractedbased on high frequency words in user communications, such asvoicemails, text messages, or emails, over a certain time period, suchas the last 12 months.

A transcriber module 310 pops the highest priority message from the topof the priority queue 306 and transcribes it. The transcriber 310 storestranscribed voicemails 312, which can include voicemail and text or justtext. The transcriber 310 can record speaker-specific syllable libraries316, effectively storing the building blocks for generating speech inthe voice of the caller. The voicemail server 304 retrieves transcribedvoicemails 312 when requested by customers 314. If a customer requests atranscription which is unavailable, the system can insert the voicemailwith a maximum priority in the priority queue 306 so the transcriber 310processes the requested transcription as soon as possible.

In another aspect, the voicemail server 304 also receives text-basedmessages for customers, such as email or short message service (SMS)messages (commonly known as text messages). The server 304 cansynthesize speech using the speaker-specific syllable library 316 forthe sender of the message to read aloud the text-based message in thevoice of the sender. As an example, Mike leaves Nathan a voicemail. Thesystem transcribes Mike's voicemail and generates a syllable library forMike. Later, Mike sends Nathan (or someone else) an email. The systemcan generate speech which reads the email aloud in Mike's voice based onMike's syllable library.

In one aspect, the system can further shorten, compress, or summarizetext and voice messages. The system can remove certain words withoutdetracting from the overall meaning of the message or replace certainwords with something more concise. The system can replace “filler” wordssuch as um, like, you know, uh, really, and just. For instance, thesystem can compress a voicemail message “Harry, it was great to see you.Let's get together. I think we should go to dinner later” to say “Harry. . . great to see you . . . Let's . . . go to dinner later”. Anotherexample is a voicemail message where the caller repeats the phrase“interest rates” more than ten times in 40 seconds. The system canabbreviate the entire message to just the key word or phrase, “interestrates”. The system can compress text-based messages by substitutingcommonly used acronyms and/or abbreviations, such as GTG (got to go),IANAL (I am not a lawyer), IM (instant message), and FWIW (for what it'sworth). The system can maintain or download updated acronyms and/orabbreviations. The system can also generate acronyms based on usage.When the system encounters unrecognized acronyms, the system can guessits meaning based on context and/or the system can ask the sender and/orthe receiver what the unrecognized acronym means. With a centralizedsystem for processing text and voice messages, when one personintroduces or updates an unrecognized acronym, the system can then usethe new acronym for all users. In one embodiment where the systemdisplays emails on a computer screen, the system can make shortened textexpandable to the original text upon some user input, such as a click,speech input, or mouse gesture. These message compression techniques canalso be applied to web pages, bulletin boards, or other bodies of textas well.

Embodiments within the scope of the present invention may also includecomputer-readable media for carrying or having computer-executableinstructions or data structures stored thereon. Such computer-readablemedia can be any available media that can be accessed by a generalpurpose or special purpose computer, including the functional design ofany special purpose processor as discussed above. By way of example, andnot limitation, such computer-readable media can comprise RAM, ROM,EEPROM, CD-ROM or other optical disk storage, magnetic disk storage orother magnetic storage devices, or any other medium which can be used tocarry or store desired program code means in the form ofcomputer-executable instructions, data structures, or processor chipdesign. When information is transferred or provided over a network oranother communications connection (either hardwired, wireless, orcombination thereof) to a computer, the computer properly views theconnection as a computer-readable medium. Thus, any such connection isproperly termed a computer-readable medium. Combinations of the aboveshould also be included within the scope of the computer-readable media.

Computer-executable instructions include, for example, instructions anddata which cause a general purpose computer, special purpose computer,or special purpose processing device to perform a certain function orgroup of functions. Computer-executable instructions also includeprogram modules that are executed by computers in stand-alone or networkenvironments. Generally, program modules include routines, programs,objects, components, data structures, and the functions inherent in thedesign of special-purpose processors, etc. that perform particular tasksor implement particular abstract data types. Computer-executableinstructions, associated data structures, and program modules representexamples of the program code means for executing steps of the methodsdisclosed herein. The particular sequence of such executableinstructions or associated data structures represents examples ofcorresponding acts for implementing the functions described in suchsteps.

Those of skill in the art will appreciate that other embodiments of theinvention may be practiced in network computing environments with manytypes of computer system configurations, including personal computers,hand-held devices, multi-processor systems, microprocessor-based orprogrammable consumer electronics, network PCs, minicomputers, mainframecomputers, and the like. Embodiments may also be practiced indistributed computing environments where tasks are performed by localand remote processing devices that are linked (either by hardwiredlinks, wireless links, or by a combination thereof) through acommunications network. In a distributed computing environment, programmodules may be located in both local and remote memory storage devices.

The various embodiments described above are provided by way ofillustration only and should not be construed to limit the invention.For example, the principles herein may be applied to voicemail in thetraditional telephone company sense as well as voicemail left asattachments in an email inbox. Those skilled in the art will readilyrecognize various modifications and changes that may be made to thepresent invention without following the example embodiments andapplications illustrated and described herein, and without departingfrom the true spirit and scope of the present invention.

We claim:
 1. A method comprising: receiving a plurality of voicemailmessages; assigning, via a processor, a priority to each voicemailmessage in the plurality of voicemail messages, the priority based on acall frequency by a caller associated with each voicemail message and auser transcription request frequency, to yield prioritized untranscribedvoicemail messages; determining a highest ranked priority voicemail inthe prioritized untranscribed voicemail messages; and transcribing thehighest ranked priority voicemail.
 2. The method of claim 1, furthercomprising repeatedly transcribing a next highest priority untranscribedvoicemail message until no further untranscribed voicemail messagesremain.
 3. The method of claim 1, further comprising: establishing apriority ranking threshold; and repeatedly transcribing a next highestpriority voicemail message until no further untranscribed voicemailmessages remain above the priority ranking threshold.
 4. The method ofclaim 1, further comprising building a voice model of a respectivecaller while transcribing voicemail messages.
 5. The method of claim 1,wherein assigning the priority to each voicemail message is furtherbased on a user profile.
 6. The method of claim 1, further comprisinggenerating speaker-specific syllable libraries while transcribing theuntranscribed voicemail messages.
 7. The method of claim 6, furthercomprising: receiving a text-based message from a speaker associatedwith a generated speaker-specific syllable library; and generating anaudible version of the text-based message based on the generatedspeaker-specific syllable library.
 8. A system comprising: a processor;and a computer-readable storage medium having instructions stored which,when executed by the processor, cause the processor to performoperations comprising: receiving a plurality of voicemail messages;assigning a priority to each voicemail message in the plurality ofvoicemail messages, the priority based on a call frequency by a callerassociated with each voicemail message and a user transcription requestfrequency, to yield prioritized untranscribed voicemail messages;determining a highest ranked priority voicemail in the prioritizeduntranscribed voicemail messages; and transcribing the highest rankedpriority voicemail.
 9. The system of claim 8, the computer-readablestorage medium having additional instructions stored which result inoperations comprising repeatedly transcribing a next highest priorityuntranscribed voicemail message until no further untranscribed voicemailmessages remain.
 10. The system of claim 8, the computer-readablestorage medium having additional instructions stored which result inoperations comprising: establishing a priority ranking threshold; andrepeatedly transcribing a next highest priority voicemail message untilno further untranscribed voicemail messages remain above the priorityranking threshold.
 11. The system of claim 8, the computer-readablestorage medium having additional instructions stored which in operationscomprising building a voice model of a respective caller whiletranscribing voicemail messages.
 12. The system of claim 8, whereinassigning the priority to each voicemail message is further based on auser profile.
 13. The system of claim 8, wherein the computer-readablestorage medium having additional instructions stored which result inoperations comprising generating speaker-specific syllable librarieswhile transcribing the untranscribed voicemail messages.
 14. The systemof claim 13, the computer-readable storage medium having additionalinstructions stored which result in operations comprising: receiving atext-based message from a speaker associated with a generatedspeaker-specific syllable library; and generating an audible version ofthe text-based message based at least in part on the generatedspeaker-specific syllable library.
 15. A computer-readable storagedevice having instructions stored which, when executed by a processor,cause the processor to perform operations comprising: receiving aplurality of voicemail messages; assigning a priority to each voicemailmessage in the plurality of voicemail messages, the priority based on acall frequency by a caller associated with each voicemail message and auser transcription request frequency, to yield prioritized untranscribedvoicemail messages; determining a highest ranked priority voicemail inthe prioritized untranscribed voicemail messages; and transcribing thehighest ranked priority voicemail.
 16. The computer-readable storagedevice of claim 15, having additional instructions which result inoperations comprising repeatedly transcribing a next highest priorityuntranscribed voicemail message until no further untranscribed voicemailmessages remain.
 17. The computer-readable storage device of claim 15,having additional instructions which result in operations comprising:establishing a priority ranking threshold; and repeatedly transcribing anext highest priority voicemail message until no further untranscribedvoicemail messages remain above the priority ranking threshold.
 18. Thecomputer-readable storage device of claim 15, having additionalinstructions which result in operations comprising building a voicemodel of a respective caller while transcribing voicemail messages.